expandFMINIMUMNUM_FMAXIMUMNUM: Quiet is not needed for NaN vs NaN #139237

wzssyqa · 2025-05-09T10:00:02Z

New LangRef doesn't requires quieting for NaN vs NaN, aka the result may be sNaN for sNaN vs NaN.
See: #139228

llvmbot · 2025-05-09T10:00:37Z

@llvm/pr-subscribers-backend-amdgpu
@llvm/pr-subscribers-backend-x86

@llvm/pr-subscribers-llvm-selectiondag

Author: YunQiang Su (wzssyqa)

Changes

New LangRef doesn't requires quieting for NaN vs NaN, aka the result may be sNaN for sNaN vs NaN.
See: #139228

Full diff: https://github.com/llvm/llvm-project/pull/139237.diff

1 Files Affected:

(modified) llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp (-5)

diff --git a/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp b/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
index ba34c72156228..2c63c54fc03f7 100644
--- a/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
@@ -8683,11 +8683,6 @@ SDValue TargetLowering::expandFMINIMUMNUM_FMAXIMUMNUM(SDNode *Node,
 
   SDValue MinMax =
       DAG.getSelectCC(DL, LHS, RHS, LHS, RHS, IsMax ? ISD::SETGT : ISD::SETLT);
-  // If MinMax is NaN, let's quiet it.
-  if (!Flags.hasNoNaNs() && !DAG.isKnownNeverNaN(LHS) &&
-      !DAG.isKnownNeverNaN(RHS)) {
-    MinMax = DAG.getNode(ISD::FCANONICALIZE, DL, VT, MinMax, Flags);
-  }
 
   // Fixup signed zero behavior.
   if (Options.NoSignedZerosFPMath || Flags.hasNoSignedZeros() ||

arsenm · 2025-05-13T15:31:55Z

Missing test changes

wzssyqa · 2025-05-15T03:31:39Z

OK to merge? @arsenm

wzssyqa · 2025-05-20T04:08:28Z

@arsenm ping

llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp

llvm/test/CodeGen/Mips/fp-maximumnum-minimumnum.ll

arsenm · 2025-05-23T08:19:29Z

llvm/test/CodeGen/AMDGPU/fmin3-minimumnum.ll

 ; GFX8-NEXT:    s_movk_i32 s4, 0x8000
-; GFX8-NEXT:    v_lshrrev_b32_e32 v4, 16, v3
+; GFX8-NEXT:    v_cndmask_b32_e32 v3, v1, v0, vcc


I'm surprised how big the diff is here. I'm also surprised AMDGPU is going through this path for any type, bfloat should have promoted to float?

I cannot find the code that set FMAXIMUMNUM/FMINIMUMNUM to promoted for BF16 for AMD64.
I guess that it is another bug.

diff --git a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp index ade88a16193b..5e4bd36f96d0 100644 --- a/llvm/lib/Target/AMDGPU/SIISelLowering.cpp +++ b/llvm/lib/Target/AMDGPU/SIISelLowering.cpp @@ -213,7 +213,7 @@ SITargetLowering::SITargetLowering(const TargetMachine &TM, ISD::FLOG10, ISD::FEXP, ISD::FEXP2, ISD::FEXP10, ISD::FCEIL, ISD::FTRUNC, ISD::FRINT, ISD::FNEARBYINT, ISD::FROUND, ISD::FROUNDEVEN, ISD::FFLOOR, ISD::FCANONICALIZE, - ISD::SETCC}) { + ISD::SETCC, ISD::FMAXIMUMNUM,ISD::FMINIMUMNUM}) { // FIXME: The promoted to type shouldn't need to be explicit setOperationAction(Opc, MVT::bf16, Promote); AddPromotedToType(Opc, MVT::bf16, MVT::f32); @@ -776,6 +776,10 @@ SITargetLowering::SITargetLowering(const TargetMachine &TM, Vec16, Custom); setOperationAction(ISD::INSERT_VECTOR_ELT, Vec16, Expand); } + for (MVT Vec16 : + {MVT::v2bf16, MVT::v4bf16, MVT::v8bf16, MVT::v16bf16, MVT::v32bf16}) { + setOperationAction({ISD::FMAXIMUMNUM, ISD::FMINIMUMNUM}, Vec16, Promote); + } } if (Subtarget->hasVOP3PInsts()) {

I have a try with this patch. It seems making some difference. Since I don't understand AMDGPU well, I don't know whether it is correct.

We will use it to be sure that the canonicalize is removed in llvm#139237

We will use it to be sure that the canonicalize is removed in #139237

… (#141218) We will use it to be sure that the canonicalize is removed in llvm/llvm-project#139237

New LangRef doesn't requires quieting for NaN vs NaN, aka the result may be sNaN for sNaN vs NaN. See: llvm#139228

wzssyqa requested a review from arsenm May 9, 2025 10:00

llvmbot added the llvm:SelectionDAG SelectionDAGISel as well label May 9, 2025

arsenm added the floating-point Floating-point math label May 13, 2025

wzssyqa force-pushed the fix-phasing-minimumnum-maximumnum branch from 06f867c to 8003083 Compare May 14, 2025 02:25

llvmbot added backend:X86 backend:AMDGPU labels May 14, 2025

wzssyqa force-pushed the fix-phasing-minimumnum-maximumnum branch from 12c5e7c to 8e77aa2 Compare May 23, 2025 02:55

arsenm reviewed May 23, 2025

View reviewed changes

wzssyqa added a commit to wzssyqa/llvm-project that referenced this pull request May 23, 2025

MIPS: Add 64r2 test to CodeGen/fp-maximumnum-minimumnum.ll

45d28ff

We will use it to be sure that the canonicalize is removed in llvm#139237

wzssyqa mentioned this pull request May 23, 2025

MIPS: Add 64r2 test to CodeGen/fp-maximumnum-minimumnum.ll #141218

Merged

wzssyqa added a commit that referenced this pull request May 24, 2025

MIPS: Add 64r2 test to CodeGen/fp-maximumnum-minimumnum.ll (#141218)

a6ca703

We will use it to be sure that the canonicalize is removed in #139237

llvm-sync bot pushed a commit to arm/arm-toolchain that referenced this pull request May 24, 2025

Automerge: MIPS: Add 64r2 test to CodeGen/fp-maximumnum-minimumnum.ll…

1aa6e92

… (#141218) We will use it to be sure that the canonicalize is removed in llvm/llvm-project#139237

wzssyqa added 3 commits May 26, 2025 08:52

expandFMINIMUMNUM_FMAXIMUMNUM: Quiet is not needed for NaN vs NaN

bcd7526

New LangRef doesn't requires quieting for NaN vs NaN, aka the result may be sNaN for sNaN vs NaN. See: llvm#139228

Update testcase

06b5eb8

Update AMDGPU testcase

20e5fa0

wzssyqa force-pushed the fix-phasing-minimumnum-maximumnum branch from 8e77aa2 to 20e5fa0 Compare May 26, 2025 00:53

wzssyqa requested a review from arsenm May 26, 2025 01:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

expandFMINIMUMNUM_FMAXIMUMNUM: Quiet is not needed for NaN vs NaN #139237

expandFMINIMUMNUM_FMAXIMUMNUM: Quiet is not needed for NaN vs NaN #139237

Uh oh!

wzssyqa commented May 9, 2025

Uh oh!

llvmbot commented May 9, 2025 •

edited

Loading

Uh oh!

arsenm commented May 13, 2025

Uh oh!

wzssyqa commented May 15, 2025

Uh oh!

wzssyqa commented May 20, 2025

Uh oh!

Uh oh!

Uh oh!

arsenm May 23, 2025

Uh oh!

wzssyqa May 26, 2025

Uh oh!

wzssyqa May 26, 2025

Uh oh!

Uh oh!

expandFMINIMUMNUM_FMAXIMUMNUM: Quiet is not needed for NaN vs NaN #139237

Are you sure you want to change the base?

expandFMINIMUMNUM_FMAXIMUMNUM: Quiet is not needed for NaN vs NaN #139237

Uh oh!

Conversation

wzssyqa commented May 9, 2025

Uh oh!

llvmbot commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

arsenm commented May 13, 2025

Uh oh!

wzssyqa commented May 15, 2025

Uh oh!

wzssyqa commented May 20, 2025

Uh oh!

Uh oh!

Uh oh!

arsenm May 23, 2025

Choose a reason for hiding this comment

Uh oh!

wzssyqa May 26, 2025

Choose a reason for hiding this comment

Uh oh!

wzssyqa May 26, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

llvmbot commented May 9, 2025 •

edited

Loading